Kharosthi

Kharoṣṭhī
Type	Abugida
Languages	Gandhari Prakrit Tocharian Kuchean
Time period	4th century BCE - 3rd century CE
Parent systems	Proto-Sinaitic alphabet Phoenician alphabet Aramaic alphabet Kharoṣṭhī
Sister systems	Brāhmī Nabataean Syriac Palmyrenean Mandaic Pahlavi Sogdian
ISO 15924	`Khar, 305`
Direction	Right-to-left
Unicode alias	Kharoshthi
Unicode range	U+10A00—U+10A5F
Note: This page may contain IPA phonetic symbols.

The Kharoṣṭhī script, is an ancient abugida (or "alphasyllabary") used by the Gandhara culture ancient South Asia to write the Gāndhārī and Sanskrit languages. It was in use from the middle of the 3rd century BCE until it died out in its homeland around the 3rd century CE. It was also in use in Kushan, Sogdiana (see Issyk kurgan) and along the Silk Road where there is some evidence it may have survived until the 7th century in the remote way stations of Khotan and Niya. Kharoṣṭhī is encoded in the Unicode range U+10A00—U+10A5F, from version 4.1.0.

1 Form
2 Alphabet
3 Numerals
4 History
5 Tocharian languages
6 Unicode
7 See also
8 References
9 External links

Form

Kharoṣṭhī is mostly written right to left (type A), but some inscriptions (type B) already show the left to right direction that was to become universal for the later South Asian scripts.

Each syllable includes the short a sound by default, with other vowels being indicated by diacritic marks. Recent epigraphical evidence highlighted by Professor Richard Salomon of the University of Washington has shown that the order of letters in the Kharoṣṭhī script follows what has become known as the Arapacana Alphabet. As preserved in Sanskrit documents the alphabet runs:

a ra pa ca na la da ba ḍa ṣa va ta ya ṣṭa ka sa ma ga stha ja śva dha śa kha kṣa sta jñā rtha (or ha) bha cha sma hva tsa gha ṭha ṇa pha ska ysa śca ṭa ḍha

Some variations in both the number and order of syllables occur in extant texts.

Kharoṣṭhī includes only one standalone vowel sign which is used for initial vowels in words. Other initial vowels use the a character modified by diacritics. Using epigraphic evidence Salomon has established that the vowel order is a e i o u, rather than the usual vowel order for Indic scripts a i u e o. This is the same as the Semitic vowel order. Also, there is no differentiation between long and short vowels in kharoshti. Both are marked using the same vowel markers

The alphabet was used by Buddhists as a mnemonic for remembering a series of verses relating to the nature of phenomena. In Tantric Buddhism this list was incorporated into ritual practices, and later became enshrined in mantras.

Alphabet

a	i	u	e	o	ṛ

k	kh	g	gh
c	ch	j		ñ
ṭ	ṭh	ḍ	ḍh	ṇ
t	th	d	dh	n
p	ph	b	bh	m
y	r	l	v
ś	ṣ	s	h

ḱ	ṭ́h

Numerals

Kharoṣṭhī numerals
۱	۲	۳	ㄨ	۱ㄨ	۲ㄨ	۳ㄨ	ㄨㄨ	۱ㄨㄨ
1	2	3	4	5	6	7	8	9

੭	Ȝ	੭Ȝ	ȜȜ	੭ȜȜ	ȜȜȜ	੭ȜȜȜ
10	20	30	40	50	60	70

ʎ۱	ʎ۲
100	200

Kharoṣṭhī included a set of numerals that are reminiscent of Roman numerals. The symbols were I for the unit, X for four (perhaps representative of four lines or directions), ੭ for ten (doubled for twenty), and ʎ for the hundreds multiplier. The system is based on an additive and a multiplicative principle, but does not have the subtractive feature used in the Roman number system.^[1]

1	2	3	4	10	20	100	1000

Note that the table beside reads right-to-left, just like the Kharoṣṭhī abugida itself and the displayed numbers.

The numerals are encoded by Unicode at codepoints U+10A40 to U+10A47:

10A40 𐩀 One	10A41 𐩁 Two	10A42 𐩂 Three	10A43 𐩃 Four	10A44 𐩄 Ten	10A45 𐩅 Twenty	10A46 𐩆 One Hundred	10A47 𐩇 One Thousand

History

The Kharoṣṭhī script was deciphered by James Prinsep (1799–1840), using the bilingual coins of the Indo-Greeks (Obverse in Greek, reverse in Pāli, using the Kharoṣṭhī script). This in turn led to the reading of the Edicts of Ashoka, some of which, from the northwest of the Asian subcontinent, were written in the Kharoṣṭhī script.

Scholars are not in agreement as to whether the Kharoṣṭhī script evolved gradually, or was the deliberate work of a single inventor. An analysis of the script forms shows a clear dependency on the Aramaic alphabet but with extensive modifications to support the sounds found in Indic languages. One model is that the Aramaic script arrived with the Achaemenid conquest of the region of northwest India in 500 BCE and evolved over the next 200+ years to reach its final form by the 3rd century BCE where it appears in some of the Edicts of Ashoka found in northwestern part of the Indian.However, no intermediate forms have yet been found to confirm this evolutionary model, and rock and coin inscriptions from the 3rd century BCE onward show a unified and standard form.

The study of the Kharoṣṭhī script was recently invigorated by the discovery of the Gandharan Buddhist Texts, a set of birch-bark manuscripts written in Kharoṣṭhī, discovered near the Afghan city of Hadda just west of the Khyber Pass in modern Pakistan. The manuscripts were donated to the British Library in 1994. The entire set of manuscripts are dated to the 1st century CE, making them the oldest Buddhist manuscripts yet discovered.

History of the alphabet

Proto-Sinaitic script? 19 c. BCE Ugaritic 15 c. BCE Proto-Canaanite 14 c. BCE Phoenician 12 c. BCE Greek 8 c. BCE Georgian 3 c. BCE Etruscan 8 c. BCE Latin 7 c. BCE Runic 2 c. CE Coptic 3 c. CE Gothic 3 c. CE Armenian 405 Glagolitic 862 Cyrillic c. 940 Aramaic 8 c. BCE Hebrew 3 c. BCE Thaana 4 c. BCE Pahlavi 3 c. BCE Avestan 4 c. CE Palmyrene 2 c. BCE Early Steppean 2 c. BC Proto Rovas 1 c. CE Carpathian Basin Rovas 7 c. CE Szekely-Hungarian Rovas 8 c. CE Khazarian Rovas 7 c. CE Orkhon (Old Turkic) 6 c. CE Syriac 2 c. BCE Sogdian 2 c. BCE Old Uyghur Mongolian 1204 Nabataean 2 c. BCE Arabic 4 c. CE Mandaic 2 c. CE Paleohispanic 7 c. BCE Paleo-Hebrew 10 c. BCE Samaritan 6 c. BCE Epigraphic South Arabian 9 c. BCE Ge’ez 5–6 c. BCE

Meroitic 3 c. BCE

Ogham 4 c. CE

Hangul 1443

Zhuyin (Bopomofo) 1913

Tocharian languages

In the early 20th century inscriptions and documents in two new related (but mutually unintelligible) languages were discovered at various sites in the Tarim Basin written in Brahmi script. It was soon found that they belonged to the Indo-European family of languages. Our only records of the now-extinct "Tokharian A" (from the region of Turfan and Karashahr), and "Tokharian B" (mainly from the region of Kucha, but also found elsewhere), are of relatively late date – 6th to 8th century CE, when written records appear; but it is likely they arrived in the region much earlier. They are now extinct, and scholars are still trying to piece together a fuller picture of these languages, their origins, history and connections, etc.^[2]

Unicode

Kharosthi was added to the Unicode Standard in March, 2005 with the release of version 4.1.

The Unicode block for Kharosthi is U+10A00–U+10A5F:

Kharoshthi^[1] Unicode.org chart (PDF)
	0	1	2	3	4	5	6	7	8	9	A	B	C	D	E	F
U+10A0x	𐨀	𐨁	𐨂	𐨃		𐨅	𐨆						𐨌	𐨍	𐨎	𐨏
U+10A1x	𐨐	𐨑	𐨒	𐨓		𐨕	𐨖	𐨗		𐨙	𐨚	𐨛	𐨜	𐨝	𐨞	𐨟
U+10A2x	𐨠	𐨡	𐨢	𐨣	𐨤	𐨥	𐨦	𐨧	𐨨	𐨩	𐨪	𐨫	𐨬	𐨭	𐨮	𐨯
U+10A3x	𐨰	𐨱	𐨲	𐨳					𐨸	𐨹	𐨺					𐨿
U+10A4x	𐩀	𐩁	𐩂	𐩃	𐩄	𐩅	𐩆	𐩇
U+10A5x	𐩐	𐩑	𐩒	𐩓	𐩔	𐩕	𐩖	𐩗	𐩘
Notes 1.^ As of Unicode version 6.0

References

^ Graham Flegg, Numbers: Their History and Meaning, Courier Dover Publications, 2002, ISBN 9780486421650, p. 67f.
^ The Tarim Mummies: Ancient China and the Mystery of the Earliest Peoples from the West, pp. 270-296, 333-334. (2000). J. P. Mallory and Victor H. Mair. Thames & Hudson, London. ISBN 0-500-05101-1.

Dani, Ahmad Hassan. Kharoshthi Primer, Lahore Museum Publication Series - 16, Lahore, 1979
Falk, Harry. Schrift im alten Indien: Ein Forschungsbericht mit Anmerkungen, Gunter Narr Verlag, 1993 (in German)
Fussman's, Gérard. Les premiers systèmes d'écriture en Inde, in Annuaire du Collège de France 1988-1989 (in French)
Hinüber, Oscar von. Der Beginn der Schrift und frühe Schriftlichkeit in Indien, Franz Steiner Verlag, 1990 (in German)
Nasim Khan, M. Kharoshthi Manuscripts from Gandhara (2nd ed.): 2009. First published in 2008.
Norman, Kenneth R. The Development of Writing in India and its Effect upon the Pâli Canon, in Wiener Zeitschrift für die Kunde Südasiens (36), 1993
Salomon, Richard. New evidence for a Ganghari origin of the arapacana syllabary. Journal of the American Oriental Society. Apr-Jun 1990, Vol.110 (2), p. 255-273.
Salomon, Richard. An additional note on arapacana. Journal of the American Oriental Society. 1993, Vol.113 (2), p. 275-6.
Salomon, Richard. Kharoṣṭhī syllables used as location markers in Gāndhāran stūpa architecture. Pierfrancesco Callieri, ed., Architetti, Capomastri, Artigiani: L’organizzazione dei cantieri e della produzione artistica nell’asia ellenistica. Studi offerti a Domenico Faccenna nel suo ottantesimo compleanno. (Serie Orientale Rome 100; Rome: Istituto Italiano per l’Africa e l’Oriente, 2006), pp. 181–224.

External links

List of all known Kharoṣṭhī (Gandhārī) inscriptions.
Information on the Kharoṣṭhī alphabet by Omniglot
A Preliminary Study of Kharoṣṭhī Manuscript Paleography by Andrew Glass, University of Washington (2000)
On The Origin Of The Early Indian Scripts: A Review Article by Richard Salomon, University of Washington (via archive.org)
Proposal to encode Kharoṣṭhī in Unicode (includes good background info)

Types of writing systems

Overview	History of writing Grapheme

Lists	Writing systems undeciphered inventors Languages by writing system / by first written accounts

Types

Abjads

Numerals Aramaic Arabic Pitman shorthand Hebrew Jawi Nabataean Pahlavi Pegon Phoenician Proto-Canaanite Psalter Samaritan South Arabian Sogdian Syriac Tifinagh Ugaritic

Abugidas

Brahmic	Ahom Balinese Batak Baybayin Brāhmī Buhid Burmese Chakma Cham Devanāgarī Dhives Akuru Eastern Nagari Grantha Gujarati Gupta Gurmukhī Hanunó'o Javanese Kadamba Kaithi Kalinga Kannada Khmer Lanna Lao Lepcha Limbu Lontara Malayalam Meitei Mayek Mithilakshar Modi Mon Nāgarī Nepali Old Kawi Oriya Pallava 'Phags-pa Ranjana Rejang Rencong Śāradā Saurashtra Sinhala Siddhaṃ Soyombo Sundanese Sylheti Nagari Tagbanwa Tai Dam Tai Le Takri Tamil Telugu Thai Tibetan Tocharian Varang Kshiti

Others	Boyd's syllabic shorthand Canadian Aboriginal Ge'ez Japanese braille Kharoṣṭhī Meroitic Pollard Sorang Sompeng Tāna Thomas Natural Shorthand

Alphabets

Linear	Armenian Avestan Bassa Vah Borama Coptic Cyrillic Deseret Duployan shorthand Eclectic shorthand Elbasan Fraser Gabelsberger shorthand Georgian Glagolitic Gothic Gregg shorthand Greek Greco-Iberian alphabet Hangul International Phonetic Kaddare Latin Manchu Mandaic Mongolian Neo-Tifinagh New Tai Lue N'Ko Ogham Ol Chiki Old Hungarian Old Italic Old Permic Orkhon Osmanya Runic Shavian alphabet Visible Speech Vithkuqi

Non-linear	Braille Hebrew Korean Maritime flags Morse code New York Point Semaphore line Flag semaphore Moon type

Ideo/Pictograms

Aztec Blissymbol DanceWriting Dongba Míkmaq New Epoch Notation Painting Nsibidi SignWriting

Logograms

Chinese	Traditional Simplified Hanja Hán tự Kanji

Chinese-based	Chữ Nôm Jurchen Khitan large script Tangut Zhuang

Other logo-syllabic	Anatolian Cuneiform Maya Yi

Logo-consonantal	Demotic Hieratic Hieroglyphs

Numerals	Hindu-Arabic Abjad Greek (Attic) Roman

Semi-syllabaries

Full	Celtiberian Northeastern Iberian Southeastern Iberian

Redundant	Southwest Paleohispanic Pahawh Hmong Zhùyīn fúhào Khitan small script

Syllabaries

Afaka Cherokee Cypriot Geba Hiragana Katakana Kikakui Kpelle Linear B Man'yōgana Nüshu Old Persian Cuneiform Vai Woleai Yi Yugtun